Skip to content

fix(docx): add regression test for #359 list run whitespace#362

Merged
Goldziher merged 1 commit intokreuzberg-dev:mainfrom
crishoj:fix/issue-359-docx-list-whitespace
Feb 6, 2026
Merged

fix(docx): add regression test for #359 list run whitespace#362
Goldziher merged 1 commit intokreuzberg-dev:mainfrom
crishoj:fix/issue-359-docx-list-whitespace

Conversation

@crishoj
Copy link
Contributor

@crishoj crishoj commented Feb 6, 2026

Adds a failing test for #359 — DOCX list items with multiple text runs lose whitespace between runs ("Sermocinationypsiliform" instead of "Sermocination ypsiliform").

Root cause is trim_text(true) in docx-lite's XML parser stripping whitespace from <w:t> elements. Upstream fix submitted: v-lawyer/docx-lite#1

This PR is ready to merge once the upstream fix lands and docx-lite is updated.

Separate text runs within a paragraph are concatenated without spaces.
Root cause is trim_text(true) in docx-lite's XML parser stripping
whitespace from <w:t> elements.

Upstream fix: v-lawyer/docx-lite#1

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@Goldziher
Copy link
Collaborator

thank you.

@Goldziher Goldziher marked this pull request as ready for review February 6, 2026 18:32
@Goldziher Goldziher merged commit fa532a0 into kreuzberg-dev:main Feb 6, 2026
39 of 56 checks passed
@Goldziher
Copy link
Collaborator

a fix is already i n place. i merged this.

Goldziher added a commit that referenced this pull request Feb 7, 2026
fix(docx): add regression test for #359 list run whitespace
Goldziher added a commit that referenced this pull request Feb 13, 2026
fix(docx): add regression test for #359 list run whitespace
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants